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Abstract 

The Hurst coefficient H of a stochastic fractal signal is estimated using the function Oj^ A 



N max W> 



Y^i=n X [y(i) ~ Vn{i)\ > where y n {i) is defined as l/ n X)fc=oy(* ~~ n * s ^ ne dimension of 



moving average box and N max is the dimension of the stochastic series. The ability to capture 
scaling properties by be understood by observing that the function C n (i) = y(i) — y n (i) 

generates a sequence of random clusters having power-law probability distribution of the amplitude 
and of the lifetime, with exponents equal to the fractal dimension D of the stochastic series. 
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Power-law distributions are important manifestations of scale invariance as observed in 
fractal, percolating structures and self-organized dynamical systems. The scaling exponents 
of power law statistics are indeed related to the universality class of the system and thus are 
helpful to understand the fundamental processes ruling the dynamics of complex systems. 
The scaling parameters have been practically deployed in fields as different as biology, geo- 
physics, solar physics, social science. When random time records should be analyzed the 
Hurst exponent H, which is related to the fractal dimension of the time series by D = 2 — H, 
is usually evaluated. For example, healthy and sick heart beat rate can be distinguished on 
the basis of the value of H. Financial series with degree of persistence higher than that of the 
price series have been encountered. A number of frequency, time and, even, integrated do- 
main approaches have been thus developed to gain as accurate as possible estimate of these 
exponents. Such procedures generally consist in calculating appropriate statistical functions 
from the whole signal: in the time domain, Detrended Fluctuation Analysis (DFA) and 
Rescaled Range Analysis (R/S) are the most popular scalingjnethods to extract power-law 
con**- exponents fr0m the raodo. s^s M flQtft BBBEaiT 




A generalized variance o\ lA of the noisy signal y{i) with respect to y n (i), defined by: 

N max 



a 



yiA ~ ~n 51 C n (if, (1) 

J "mffit " i=n 



with: 



C n (i) = y(i)-y n (i), (2) 
has been introduced in 9]. In Eq.(JTJ, N max is the size of the series, y n (i) is the moving 
average defined by y n (i) = - Y.k=o ~ ^) anc ^ n * s moving average window. 

The function ctma corresponding to each y n (i) has been calculated for random series y(i) 
with different Hurst exponent H. The values of &ma are plotted as a function of n on log-log 
axes shown in Fig. [J, for a series with N max = 2 19 and 10 < n < 10 4 . The most remarkable 
property of the curves plotted in Fig.£Q is their power-law dependence on n, i.e.: 

oma « n H . (3) 

The structure of the algorithm based on the om a function is analogous to the DFA tech- 
nique, but a higher accuracy has been observed. This may be due to the better smogthing 
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and detrending action of the moving average with respect to the linear or to the polynomial 
fit. Furthermore, the moving average filter dynamically detrends the series. Every time the 
discrete index i increases by a unity, the box window n switches its position accordingly, 
allowing also for online applications of the present technique. In order to gain a deeper 
insight of the function a ma, we will show that C n (i) generates, for each y n (i), a sequence of 
clusters with amplitude and lifetime distributed as power-laws. Furthermore the exponent 
of these power-laws is equal to tte fractal dimension D = 2 — H of the time series. These 
properties allow to understand the ability of oma to estimate the Hurst exponent H of the 
stochastic series. 

Consider thus the sub-set of y(i) corresponding to the region delimited by two consecutive 
intersections between y(i) and y n (i). Let us refer to such regions as cluster. In Fig.j2], one 
of these clusters is shown. For each cluster j, the size lj : 

ic(j + l) 

h= E y(0 (4) 

i=ic(j) 

and the duration Tf. 

Tj = i c (j + l)-i c (j). (5) 

can be defined. In the previous relationships, i c (j) and i c (j + 1) are the values taken by 
the index i in correspondence of two subsequent intersections between y n {i) and y(i). The 
size lj and the duration Tj of the clusters are shown in Fig.Q for a series with N max = 2 19 , 
H = 5.3 and n = 30. 

The probability distribution P{z) of the cluster sizes has been determined by counting 
the number of clusters with length I. Fig. [H] shows the log-log plot of the distribution P(l) 
for series having different H (respectively H = 0.2, H = 0.5 and H = 0.8). The clusters are 
obtained by the intersection of the series with a moving average with a window amplitude 
n = 60. The curves are consistent with a straight line, indicating a power-law behavior: 

p(i) « r° (6) 

The linear behavior over two decades indicates a scaling distribution of clusters. The 
exponent a has been plotted against H can the inset of Fig. [3] . 
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The probability distribution P(t) of the cluster lifetime has been determined by counting 
the number of clusters with lifetime r. Fig. shows the log-log plot of the distribution P(t) 
for series having different H (respectively H = 0.2, H = 0.4 and H = 0.8). The clusters are 
obtained by the intersection of the series with a moving average with a window amplitude 
n = 60. This leads to another line indicating a distribution of lifetimes of the form: 

P(r) « t-P. (7) 

The values of a and /3, for H varying from 0.10 to 0.90, are reported in Table (1). The 
distributions P(l) and P(t) have been calculated for a wide range of values of N max and n 
( 2 18 < N max < 2 21 and 60 < n < 1000), the exponents a and f3 have been found to be 
independent of N max and n. It can be concluded that a and (3 vary as 2 — H, i.e. as the 
fractal dimension D of the noisy signal. 

These results can be understood keeping in mind the box-counting method to estimate 
the fractal dimension of a random signal is recalled (Ch.10 of jl|). The stochastic signal 
is covered with boxes of width br (in time) and of length ba (in amplitude), with r • a the 
minimum box. The box dimension Db is related to the number N{b;a,r) of the boxes 
needed to cover the record by the relationship: N(b; a, r) m b~° B , where Db = 2 — H for 
self-affine records. 

We have demonstrated that the Hurst exponent H of long-range correlated stochastic 
series y(i) can be calculated using the function oma defined by the Eq.((TJ). We have found 
indeed the remarkable result that the function oma varies as a power-law of the amplitude 
n of the moving average box. This work shed more light on the results of the papers 0, |^| 
and reinforce the general idea that a deeper physical insight may add new perspectives to 
the practical methods of financial analysis. 

These results, appeared in very preliminary form in Q], have been here validated in terms 
of the statistical properties of the random clusters tracked by y n (i) over the random time 
series. Furthermore, the power-law behavior of the distributions of cluster size and lifetime, 
just as expected from general arguments about dynamical system, clarifies the intrinsic 
capability of the function oma to capture scaling exponents of the series. Furthermore, the 
exponents a and j3 of Eqs. (|6|7jl are insensitive to the parameters characterizing y n (i) and 
y(i). This robustness is essential to the assessment of the proposed scaling technique. Last 
but not least, the oma algorithm yields in higher accuracy, speed of execution and possibility 



of online application. 
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FIG. 1: Log-log plot of the function <jma vs. the moving average box n. The curves have been 
obtained using the computational algorithm described in the text. The curves refers to artificially- 
generated series having N max = 2 19 and Hurst coefficient H varying between 0.10 and 0.90. A 
power-law dependence (<jma °c n H ) is found. The results are independent of the series dimension 

Nmax- 
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FIG. 2: Stochastic series y n (i) obtained by the Random Midpoint Displacement algorithm with 
H = 0.3. The size of the series is N max = 2 19 . The moving average y n (i), with box dimension 
n = 30, is also shown. The total length of the segment and the time interval between two subsequent 
crossing points represent respectively the size and the duration (lifetime) of the cluster according 
to the definition given in the text (Eas. (|4l5[l ). 
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FIG. 3: Log-log plot of distribution P(l) of the cluster size I for series having different H (respec- 
tively H = 0.2, H = 0.5 and H = 0.8). The clusters are obtained by the intersection of the series 
with a moving average with a window amplitude n = 60. A power-law dependence P(l) ps l~ a has 
been observed. It is worth-mentioning that the exponent a depends on the Hurst exponent of the 
series, but is independent of the series dimension N max and of the moving average box n. 
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FIG. 4: Log-log plot of the distribution P(t) of the cluster lifetime r for series having different 
H (respectively H = 0.2, H = 0.5 and H = 0.8). The clusters are obtained by the intersection of 
the series with a moving average with a window amplitude n = 60. The lifetime of the cluster 
corresponds to the time interval between two consecutive intersection points, as defined by the 
Eq.©. A power-law dependence P(t) = t~@ has been observed. It is worth-mentioning that the 
exponent /3 depends on the Hurst exponent of the series, but is independent of the dimension 
Nmax and of the moving average box n. 
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